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Abstract — We investigate the decoding region for Algebraic 
Soft-Decision Decoding (ASD) of Reed-Solomon codes in a 
discrete, memoryless, additive-noise channel. An expression is 
derived for the error correction radius within which the soft- 
decision decoder produces a list that contains the transmitted 
codeword. The error radius for ASD is shown to be larger 
than that of Guruswami-Sudan (GS) hard-decision decoding for 
a subset of low-rate codes. These results are also extended to 
multivariable interpolation in the sense of Parvaresh and Vardy. 
An upper bound is then presented for ASD's probability of error, 
where an error is denned as the event that the decoder selects 
an erroneous codeword from its list. This new definition gives a 
more accurate bound on the probability of error of ASD than 
the results available in the literature. 



I. Introduction 

Reed-Solomon (RS) codes are used in a wide variety of ap- 
plications currently, and the classical algorithm of Berlekamp 
and Massey (BM algorithm) has been employed for decoding 
in most cases. For an RS code of length n and dimension 
k, this algorithm is guaranteed to recover the transmitted 
codeword within an error r adius of [^^-\. 

Guruswami and Sudan [7] presented an important new 
algebraic decoding method for RS codes that is able to correct 
errors beyond the BM decoding radius. This method involves 
constructing a bivariate polynomial with zeros of multiplicity 
based on the received symbols. The polynomial can then be 
factored to give a list of candidate codewords; thus, it is a 
list decoder. A Guruswami-Sudan (GS) decoder includes the 
transmitted codeword on its output list if the errors fall within 
a radius of n — \fnk. 

In [7], the authors mention that their hard-decision algebraic 
decoding technique can be extended to soft-decision decoding 
by setting the values of the multiplicities based on channel 
posterior probabilities and not received symbols. However, [7] 
does not provide a way of assigning these multiplicities, which 
turns out to be a nontrivial component in the RS decoding 
procedures. Koetter and Vardy refined the Algebraic Soft- 
Decision Decoding (ASD) approach in [11] by providing an 
algorithm that converts a matrix of posterior probabilities II 
to a multiplicity assignment matrix M.. 

Various papers, such as [13], [3], [8], have been published 
since [11] that propose using a different method for converting 
II to M.. However, few papers have given insight into the 
decoding region of ASD. One exception is [8], which charac- 
terizes the decoding region for medium to high rate codes over 
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binary erasure and binary symmetric channels. Another paper 
[10] derives an error correction radius for an arbitrary additive 
cost function associated with transitions in the channel. 

This paper examines the performance of ASD when the 
noise is additive, i.e., there is a probability distribution p(a) on 
q-ary errors that does not depend on the transmitted sequence. 
Relying on this distribution, we derive in Sect. [Ill] simple 
estimates of the error radius within which the transmitted 
codeword is guaranteed to be on the list produced by ASD. 
With the simple expression for the error radius obtained, 
we are also able to characterize the region where the ASD 
algorithm provides an improvement over the GS decoding 
radius. 

In Section |IV| we study bounds on the probability of error 
for ASD decoding. Prior work [15] has concentrated on the 
list-decoding error event, namely the event that the transmitted 
codeword fails to be included in the list. We point out that 
for low code rates, the list decoding error criterion does not 
provide insight into the performance of the decoder because 
the transmitted codeword always will be included in the list. 
We therefore define decoding to be successful if the ASD 
algorithm selects the transmitted codeword from the decoder's 
list, and we derive an upper bound for the probability of 
error based on this new definition. Finally, in Section [V] we 
briefly discuss soft-decision decoding of multivariate RS codes 
introduced in a recent work of Parvaresh and Vardy [14] and 
extend to this case our estimate of the ASD error radius. 

The error bounds obtained in Sections iHflVl are either the 
first of their kind available in the literature, or, as in the case of 
the ASD list-decoding error bound, improve the results known 
previously. 



II. Decoding of RS codes 



A. Notation 



Let q be a prime power, let ¥ q = {a\ = 0, a.2, ■ ■ ■ , a q } 
be the finite field of q elements, and let n = q — 1 be the 
code length. Denote dist(-, •) as the Hamming distance. With 
a vector v = (v\, . . . ,v n ) € F™ we associate an integer- valued 
function v(i) whose value is j if Vi = aj. 

For a polynomial / e ¥[x] define the evaluation mapping 
eval : / — > F™ given by (eval/)^ = f(ai), 2 < i < q. Thus, 
the evaluation mapping associates a q-ary n-vector to every 
polynomial / <E F g [x]. 

Definition 1: A q-ary RS code C of length n = q — 1 and 
dimension k is the set of codewords of the form 

{c = eval(/): f e¥ q [x},0<degf <k-l}. 

To describe the encoding of the code C, suppose that the 
message to be transmitted is u — (ui,U2, ■ ■ ■ , Ufe) where u, £ 



¥ q , 1 < i < k. The codeword that corresponds to it is given 
by c = eval(/), where the polynomial / has the form 

f(X) = Ul + u 2 X + u 3 X 2 + ■■■ + u k X k - x . 

We assume that the codeword c is transmitted over a discrete 
memoryless channel. In the hard-decision case, the output of 



the channel is the vector y 



e. Let Wi 



a»|c = otj) be the probability that the symbol aj transmitted 
over the channel is received as a*. We will assume that the 
noise is additive, i.e. there exists a probability distribution p 
on F q such that Pr(a + e\a) = p(e),a,e £ ¥ q . Note that 
under these assumptions, the channel is symmetric as defined 
in Section 8.2 of [2]. 

In the setting of soft-decision decoding, the demodulator is 
assumed to provide the decoder with the posterior probabilities 
conditioned on the received (continuous) signal. However, the 
task of analyzing this setting in the context of algebraic list 
decoding so far has proved elusive. We will therefore assume 
that the receiver outputs in each position of the codeword, a 
hard-decision symbol (for instance, the most likely transmitted 
symbol) together with the q values of posterior probabilities 
TTij = Pr(c = cti\y = aj). As customary in the literature, we 
will assume that ASD takes the channel's output to be in the 
form of a q x n matrix II = [iTij]. 

This paper concentrates on the ASD algorithm of [11] and 
compares its performance to the well-known hard-decision 
decoding algorithms of Berlekamp and Massey (see, e.g., [1]) 
and Guruswami and Sudan [7], [12]. 



B. Hard-Decision Decoding Methods 

Under BM decoding, if the number of errors t = dist(c, y) 
satisfies 

n-k+1 



t < 



(1) 



then the decoder will output c. If condition ([]]) is not true, then 
decoding is guaranteed to fail. Therefore, (Q]) is a necessary 
condition for BM decoding success. 

GS decoding produces a list that contains all the codewords 
of a certain distance t m from the vector y and potentially 
some codewords outside of this Hamming ball. List decoding 
success is declared if the correct codeword is on the list. The 
distance t m is determined by m which is a parameter of the 
algorithm. As m increases, t m increases to an asymptotic limit 
given in Lemma Q] 

Lemma 1: (Guruswami and Sudan [7]) Let m — > oo. Let c 
be a codeword that satisfies 



dist(y. c) < n 



'nk. 



(2) 



Then c will be included in the list output by the GS decoder 
with input y. 

The complexity of the algorithm often becomes a limiting 
factor before the maximum possible t m is achieved. Note that 
(f2j) is only a sufficient condition on GS list-decoding success: 
the decoding is guaranteed to have the transmitted codeword 
on the list if the error pattern satisfies (O, assuming large m. 



Let r = t/n be the normalized error correction radius of RS 
decoding algorithms, and let R — k /n be the rate of the code. 
We have 



t = (BM Decoding) 

2 5 (3) 

r = 1 — VH (GS Decoding, m large). 

The two error radii given in @ are shown as the dashed curves 
in Figure 1(a). The GS decoding radius is always greater than 
its BM counterpart although the difference becomes small 
for high rates. However, no error radius for Algebraic Soft- 
Decision Decoding was previously known. 

C. ASD Algorithms 

ASD is an extension of GS decoding by the manipulation of 
the multiplicities. We only mention the salient features of this 
algorithm, referring to [11] for the details. Instead of operating 
on a vector y, ASD takes as input a multiplicity matrix M. 
of dimensions q x n constructed on the basis of the posterior 
probability matrix n. The soft-decision decoder constructs a 
bivariate polynomial Q(X,Y) that has zeros of multiplicity 
set by M.. In the next step, the decoder recovers a list C of 
putative transmitted codewords from the F-zeros of Q(X, Y). 
In contrast to GS decoding that always constructs Q(X,Y) 
based on n distinct zeros, ASD can have up to qn distinct 
zeros. 

Definition 2: Define the Score of a vector v = 
(v\, . . . ,v n ) £ F™ with respect to a multiplicity matrix M. 
to be 



1=1 



where v(i) = £ if Vi = ag. 

If needed, in the last stage the decoder chooses the codeword 
c from the list C with the largest score or the codeword with 
the largest probability P n (c\y) — Y\^ Ci .yi- 

1) The Multiplicity Matrix: The matrix M. is determined 
from the matrix of posterior probabilities II. Koetter and Vardy 
[11], Parvaresh and Vardy [13], and El-Khamy and McEliece 
[3] have proposed various methods for determining M. from 
n. A simple method for converting II to M. that will be used in 
this paper is called the Proportionality Multiplicity Assignment 
Strategy (PMAS) proposed by Gross et al. [5]. PMAS finds 
M. by performing the following element-wise calculation on 
n for some fixed number A : 



[ATTij-J, i = !,...,!; j 



.,71. 



(4) 



Thus, the matrix M. is determined uniquely from the received 
vector y and the properties of the communication channel. The 
parameter A £ "L + is the complexity factor, and its adjustment 
controls directly the balance between the performance and 
the complexity of ASD. Another important measure of the 
complexity of the decoder is the cost of the multiplicity matrix, 
defined as follows. 

Definition 3: Let the Cost of a multiplicity matrix M. be 



2) Threshold Condition: GS decoding includes on its out- 
put list all codewords in the Hamming space within a certain 
radius of the received vector y, but ASD has no known 
geometric interpretation. A sufficient condition for ASD's 
success is determined indirectly by the vector y and can be 
stated in terms of the score Sm ( c ) as given in the next lemma. 

Lemma 2: [11] Suppose ASD is used to decode a received 
vector y with the RS code. If 



S M (c) > y/2(k-l)C(M) 



or equivalently 



^2 m c(i),i > (k - l)^2m t ^(m hj + 1), 
»'=1 y i,0 

then the transmitted codeword is on the decoder's list. 

Lemma [2] can be used to evaluate ASD's performance in 
the case that c is transmitted, y is received, and M. is the 
multiplicity matrix. 

3) Size of the List: The decoder's list C cannot exceed 
the F-degree of Q(X,Y) since C is obtained by factoring 
Q(X, Y) for the F-roots. The size of the list in terms of the 
F-degree L of Q is estimated in the next lemma, which is 
due to McEliece [12], Corollary 5.14. 

Lemma 3: For k > 2, the number of codewords on the list 
of an algebraic soft-decision decoder does not exceed 



L 



'2C(M) fk + 1 



k-1 



\2k-2 



fc + 1 
2k- 2 



The condition \C\ < L is met with equality if and only if all 
the F-roots have degree less than k. 

III. ASD Error Correction Performance 

In this section, we present one of our main results, an 
estimate of the error correction radius t of the algorithm. We 
would like to stress one essential difference of the result below 
from the other similar results in the literature. In the case of 
BM and GS decoding for instance, all of the codewords within 
the error radius are included in the list output by the decoder. 
In contrast, we only guarantee in the case of ASD that if the 
transmitted codeword is distance t away from the received one, 
then it will be included on decoder's list. Other codewords, 
even within the sphere of radius t from y, may escape being 
output by the decoder. In other words, the decoding regions of 
ASD are far from being spherical, and in fact, no geometric 
characterization of them is available. 

A. Setting 

Following Koetter and Vardy in [11] and Justesen in [9], 
we assume that each symbol entering the channel is uniformly 
drawn from ¥ q . It follows that 7Tj j = Wij. Since the channel is 
symmetric, we know that the channel transition probabilities 
Wi j are drawn from the set {pi,P2, ■ ■ ■ ,Pq}- Next, we will 
introduce the three channel statistics 



When the channel is noiseless, set p m i n = 0. We will assume 
throughout that the channel's capacity is greater than zero, 
giving us p max > p min . As will be seen later, p max , p min , and 
7 will be the only channel statistics necessary in our analysis 
of ASD's performance. 

B. Error Radius 

Theorem 1: Suppose that an RS code with rate R = k /n is 
used to communicate over an discrete, additive-noise channel. 
Suppose that a codeword c is transmitted and an algebraic soft- 
decision decoder with complexity factor A is used to decode 
the received vector y. Let t — dist(c, y). If 



- < 

n 



(5) 



then c will be contained in the output list of the decoder. 

Proof: Let c be the transmitted codeword and let y be 
the received vector. Substituting in Lemma we get 



E"=l LM:(i),y(i)J 



> Vk- 1. 



ELi£^(LA^-J 2 + LA^J) 



Instead of this condition for successful decoding let us use a 
more stringent one obtained by removing the integer parts: 



E"=l {^ W c(i),y(i) ~ 1) 



> y/k^l. (6) 



TILi£j = i((A^) 2 + Au^) 
Rearranging and using the channel statistics 7 yields 



1 = 1 



7 1 1 

n Xn A 



Inequality (|7]l certainly holds true for the codeword c if 
1 



i=l 



1 



1 



X A 



(7) 



(8) 



We have derived a condition based on a specific y, but we 
are interested in ASD's performance for any y when c is 
transmitted. Thus, y becomes a random variable. Let Wi be a 
random transition probability w c (j).y(i) given random y. Note 
that the Wi& do not depend on c because of the additive noise 
assumption. The p.m.f. of W% is as follows: 

PWiipi) ■= Pr{VK; = pi} = p h i = l,2,...,q. 

We can now rewrite © as follows: 

1 




Since the received vector y differs from c in f coordinates, 
we can bound the left-hand side of the last inequality below 
as follows: 



Wi > t P 

min 

+ (n — t)p 

n 



max pi 

l<i<q 



mm pi 

l<i<q 
i:pi>0 



Erf- 

»=i 



Indeed, if ^ c; then the probability w c u\ v u\ > p m in- 
On the other hand, if there is no error in coordinate i then 



the probability w c (i),y(i) — Pmax! otherwise, communication 
over the channel would be impossible. Thus, c is on the soft- 
decision decoder's list if 



1 



(tPn 



t)Pn 




The theorem follows. □ 
A first look at the error radius in Theorem Q] reveals that © 
becomes large as p mm approaches p max - In other words, ASD 
performs well when the channel is far from g-ary symmetric. 
If the channel is noiseless and A is sufficiently large, then 
the bound in Theorem Q] reduces to 1 — y/R which is the GS 
normalized error bound. Let us consider a few examples. 

Example 1: Consider the "typewriter channel" where 
Wi t i — 0.8, Wij — 0.2 for some j ^ i, and Wij — for 
all the remaining pairs Thus, p max = 0.8, p m in — 0.2, 

and 7 = 0.68. Let us set A = 100. Figure 1(a) shows the 
normalized error correction radius compared to the GS error 
bound for this example. The BM error bound is also shown 
for reference. ASD is able to produce a list with the codeword 
c for a greater error radius than GS decoding for many low 
to medium rates. The range of rates for which ASD decoding 
corrects more errors than GS decoding is characterized below 
in this section. 

Example 2: Next, let us look at the error radius of Theorem 
[T]when there are two possible, equiprobable errors per symbol 
transmission, termed the "two-error channel." In this case, 
Pmax = 0.8, pmin = 0.1, 7 — 0.66, and we again set A = 100. 
Figure 1(b) shows the ASD, GS, and BM curves for this 
example. 

Example 3: Figure 1(c) shows the normalized error bound 
compared to the GS error bound for a g-ary symmetric channel 
with p m ax = 0.805 and q — 16. Thus, p m in = 0.013 and 
7 = 0.6506, and we set A = 100. ASD still provides an 
improvement, but it is only for extremely low-rate codes. 
The lack of improvement for the g-ary symmetric channel is 
expected since all error patterns, given that there are t errors, 
are equally probable. 

As the channel approaches g-ary symmetric, ASD is shown 
to have a progressively smaller improvement over GS decod- 
ing for low-rate codes. The fact that the bound on the error 
radius for ASD is below the GS radius for higher rates is due 
to to the approximations used in the proof of Theorem Q] The 
true ASD error radius is likely to be slightly greater or within 
a neighborhood of the GS decoding radius for all code rates. 

In [10], Koetter considers a general transmission scenario 
when an RS or related code is used for communication 
over a symmetric channel, and the decoder's performance 
is measured in terms of an arbitrary additive cost function 



A : F„ 



[10] attempts at optimizing the assignment 



of multiplicities for the purpose of finding the maximum 
attainable decoding radius. An error vector e € F™ is assigned 
the cost 



A(e) 



;A( ei ). 



The result of [10] stated for an additive channel, is as follows 
(the original text contains some misprints). 



Theorem 2: Suppose that a code vector c of an RS code of 
rate R transmitted over the channel is received as y = c+e. If 
the values R and A(e) simultaneously satisfy the inequalities 



R< (l-e) £>-0A(a)ft (e>0) (9) 

a£W q 

A(e)< A(a)[p-0A(a)]+ (10) 

aGF, 

where [x] + = max(0,:r), 6 > is an arbitrary parameter, and 
£>-0A(a)] + = l, 

aGF, 

then the vector c will be placed on the ASD output list. 
Moreover, relations d9b-(TT0l) characterize the optimum tradeoff 
between the code rate and the cost of error. 

We note that even though the theorem's result is optimum 
for the "error radius" in terms of the cost function, its result 
is inferior to the result of Theorem [T] in the following sense. 
For the previous theorem to give a performance curve over 
Hamming distance, the decoder is optimized over Hamming 
distance, a concept that is congruent with a hard-decision de- 
coder, not a soft-decision decoder. Namely, taking the distance 
as the cost function, we have A(ct;) = 1 — <5 Qi ,o, i = 1, ■ . . ,q 
(the cost is 1 for all the symbols of ¥ q except for some 
distinguished symbol whose value is of no importance). Then 
(191- ([Toll reduce to the equation 



t 

- < 



it 



nR 



ii 



n 

1 - e 



n - 



n - 



1 



1 y(n + l)(l-e) (n + l) 2 ' 
<R<l-e 



which is an error radius that slightly exceeds the GS bound 
d2J for most rates. As expected, it is inferior to (0 for a subset 
of low to medium rate codes. 



C. Size of the List 

Proposition 1: Assume that k > 2. Given an additive noise 
channel, the size of the list for an algebraic soft-decision 
decoder is bounded above by 



' n(A 2 7 + A) 
k - 1 



1 



2fc-2 



1 



k-2 



(11) 



Proof: Assuming PMAS ©, observe that 



2C(M) = n 



ELApd 2 

i=l 



i=l 



Substituting this upper bound in Lemma [3] gives the result. □ 
The bound in Proposition Q] is based on the F-degree of the 
polynomial Q(X,Y). Thus, it includes polynomials that are 
a good fit for the set of multiplicity points, including higher- 
degree polynomials. As a result, the bound of Proposition [T] 
is not tight. 

Example 4: Figure 2 shows a graph of the bound on the list 
size presented in Proposition Q] for Example Q] with n = 255. 
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Decoding radius of ASD compared to GS and BM decoding for a), the "typewriter channel" of Example 1, b), "two-error channel" of Example 2, 
the q-ary symmetric channel of Example 3. 
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Fig. 2. List size using ASD for the "typewriter channel". 



Since the bound in (fTTI) is a strictly decreasing function of 
the code's dimension k, an upper bound on the size of the list 
is obtained by taking k = 2 : 



LCI < 



nA 2 7 - 



x 9 3 



< V«(A 2 7 + A). 



Thus, the number of codewords on the list is bounded above 
by a slowly growing function of n. 



For the <?-ary symmetric channel, a case where intuition 
tells us that ASD should provide no improvement over GS 
decoding, Figure 1(c) still shows a region that where ASD's 
error radius exceeds GS decoding 's error radius. However, 
there is no code construction that simultaneously satisfies 
the condition A > l/p mm and ( fT2b for the g-ary symmetric 
channel unless we have A — > oo. Thus, there is no achievable 
rate region where the ASD radius is larger than the GS 
decoding radius except when the size of the list is unbounded. 



D. A Closer Look at the ASD Error Radius 

We are interested in quantifying when the error radius of 
ASD exceeds the error radius of GS decoding. For simplicity 
we only analyze the case of GS decoding with m — » oo (note 
that it will also imply that ASD is better than GS decoding 
for any finite to). 

Corollary 1: With A > 1 /p min , the algebraic soft-decoding 
radius exceeds the GS decoding radius if 



Pu 



1/A 



Proof: 



R< , _ 

mm 

Solving the equation 



(12) 



y/R ( 7 + 1/A) -1/A 



Pn 



Pu 



>i-Vr 



for R and assuming A > 1 /p mia yields the corollary. □ 
Corollary Q] gives a sufficient condition for the ASD correc- 
tion radius to exceed the GS decoding radius. This condition 
is nontrivial for a subset of low code rates and A large enough 
(see Example [T). One also notices in Example Q] that there is 
another non-zero subset of code rates where the transmitted 
codeword is always on the list, i.e. t /n < 1. Corollary [2] 
quantifies that region. 

Corollary 2: Let A > 1 / PmlIl . If 



R < 



(Pn 



then an algebraic soft-decision decoder will always produce a 
list containing the transmitted codeword c. 

Proof: If the right-hand side of © is 1, then ASD will 
produce a list that contains c regardless of the error pattern. 
Thus, the claim reduces to solving for R the inequality 



Pn 



Pn 



> 1, 



from which the corollary follows. □ 
It is a surprising result that there exists non-zero rates where 
ASD always produces a list polynomial in n that contains the 
transmitted codeword. The intuition is that Pr(y\c) ^ for all 
channels, but for many codewords c' S C, Pr(y\c') = due 
to zeros in the transition probability matrix. The zeros in the 
transition probability matrix allow the soft-decision decoder 
to discount codewords. When the code rate is low enough, 
the decoder can reduce the size of the list to be within the 
bound of Proposition Q] without actually eliminating possible 
codewords. In this case, the probability of list-decoding error 



IV. ASD Probability of Error Bounds 

This section is focused on bounding the probability of 
error for ASD. In the list-decoding setting, the probability 
of error has generally been defined as the probability that 
the transmitted codeword is not on the decoder's list. In the 
final stage of decoding, it may be required to select a unique 
codeword candidate from the list obtained using the maximum 
likelihood criterion. In this section, we derive a bound for the 
probability of error when the correct decoding corresponds to 
the case that the transmitted codeword is on the decoder's list 
and it is selected as the best estimate. 

The only previous work that deals with deriving bounds on 
the probability of error for ASD is [15]. In that paper, Ratnakar 
and Koetter consider a general channel and use list-decoding 
error as the error event. We refine the results of [15] in two 
ways: first, we prove a tighter bound on the list-decoding error 
probability for low rates, and secondly, we derive a bound on 
the probability of selection error that is the dominating error 
event for those rates. 

A. General Form 

As in the previous section, the channel is assumed to be 
additive and memoryless. The transmitted codeword is c, the 
decoder's list is C, and the decoder's chosen codeword is c. 
Define the following random events A and B as 

A : c <£ C, B:c^c. 

The list-decoding probability of error, which is the probabil- 
ity that the list produced by ASD contains the transmitted 
codeword, is given by P{A}. The selection probability of 
error is given by P{B}. Observe that A C B, giving us 
P{„4} < P{B}. Each of these probabilities can be bounded 
above by using the Chernoff bound (see e.g. [4]). 

Lemma 4: (Chernoff bound) Let w be a random variable 
with moment generating function $ w (s) and let A be a real 
number. Then 



Pr{w >A}< e~ sA <f> w (s), s > 



(13) 



Pr{w <A}< e sA <f> w (-s), s > 0. (14) 
In applications, one optimizes on the choice of s to obtain 
the tightest bound possible. The Chernoff bound will allow us 
to write 



P{A} < e 



-nE A 



P{B} < e~ nEe . 



The functions Ej, and E® are the error exponents for -P{-4} 
and P{B}, respectively. 



B. List-Decoding Probability of Error 

The next theorem gives an upper bound on the probability 
that the transmitted codeword is not on the list output by the 
decoder. A similar estimate of the error exponent, in a more 
general context, is derived in [15]. However [15] does not 
contain the analysis of the error radius that leads us to conclude 
that for low rates the transmitted codeword will always be on 
the list. In other words, the error event for these (and some 
other) rates will be dominated by an incorrect selection from 
the list obtained through ASD. 

Theorem 3: The probability of event A can be bounded 
above as 

P{A} < e- nEA , 



where 



E A = - 



q 

my^e 



except when R < ^^7^ and A > 1 /p Ynia , in which case 
Ea = 00. 

Proof: The rate region where Ea = 00 follows directly 
from Corollary Q] In order to gain insight into the event A 
for the remainder of the rates, consider again the threshold 
condition for ASD list-decoding success given in Lemma |2] 
If the condition in Lemma [2] is met, it is clear that A is false. 
However, if the condition is not met, A could either be true 
or false. Thus, for a given M., we have 

P{A) < Pr {s M (c) < y/2(k-l)C(M)} . 

Assume now that the received vector y is a random vector with 
coordinates distributed according to the transition probabilities 
in the channel. As above, let W% be a random transition 
probability w c u\yu\. With these assumptions, the components 
of the matrix of multiplicities as well as the score of a 
codeword c and the cost of M. become random variables. 
Then 



Pv{S M (c) <y/2(k-l)C(M)} < 



Finally, the Chernoff bound ( fT~4T > can be applied to give the 
result 



P{A} < e s 



»(V*(t+x)+x) 



-sp, 



where s > 0. The theorem follows. □ 
In order to obtain the tightest bound, we need to maximize 
Ea through proper choice of s. If we define g(s) by Ea = 
— ln(g(s)), then the goal is to minimize g(s). When the 
maximum value of Ea is negative, then no conclusion can 
be drawn regarding the possibility of reliable communication. 
The following lemma shows that reliable communication is 
possible when the code rate is less than a rate maximum that 
can be well-estimated by 7. 



Lemma 5: 

E A >0 if R < 
Proof: We have that 



(7-1 A) 2 
7 + 1/A 



g(s) = E Pie 



-«(p<-V«(T+i)-*)_ 



First observe that g"(s) > 0, indicating that the function g(s) 
is convex. Next observe that g(0) = 1 and that g'(0) < 
if and only if R < (7 - l/A) 2 /(7 + 1/A). In the case that 
g'(0) < 0, the minimum value of g(s) is achieved for some 
s' > 0, and since g(0) = 1, g(s') < 1. Thus, E A > 0. 
Otherwise, g(s) > 1 which gives a trivial estimate Ea = 0. 

□ 

C. Selection Probability of Error 

When the list-decoding probability of error is zero as for 
the rate region defined in Corollary |2] one does not have 
insight into the performance of ASD. Therefore, it is of interest 
to quantify a comprehensive probability of error given by 
P{B}. To make the selection error probability amenable to 
analysis we consider the ensemble of random Generalized 
Reed-Solomon (GRS) codes. 

Let C be an RS code and w = (wi , . . . , w n ) be a vector of 
nonzero elements of F g . A GRSfc (w) code is obtained from 
C be multiplying every code vector c € C by a diagonal 
matrix diag (wi, . . . , w n ). Thus, the code C gives rise to the 
ensemble of (q — l) n GRSfe codes with uniform distribution 
on it induced by the choice of the modifying vector w. 

We will assume transmission with a random GRS code 
where w is chosen uniformly before each transmission. 

Theorem 4: Suppose that a random GRS code G of rate R 
is used to transmit through an additive channel. The decoding 
error probability of ASD can be bounded above as 

P{B} < e' nEt3 



where 

E B 



In 



g*-i( e '/A 



-*(p<-p*— t) n 



1 1 

i=i j=i 

Proof: Let {ci, . . . , cm} be the codewords of the code 
G. Define the events C, and T>i as follows: 

d = {(c t e £) & (S M (ci) > S M (c))} 

Vi = {S M ( Cl ) > S M (c)} 

where C is the decoder's list of codewords, c is the transmitted 
codeword, and M. is the multiplicity matrix. 
Observe that 

P{B} = Pr{3c' ? c, d e C : S M (d) > S M {c)} 

= p { U c 4- £ P ^ 

i=l i=l 



< E p {^>- 
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Fig. 3. Comparison of the two error exponents for ASD over low rates. 



For all Vi e F™, define the event as follows: 



We have 



Si : {S M {vi) > S M (c)}. 



P{Vi}= E P{Si,VieG}. 



4=1 



»=1 



Further, every coordinate of c is distributed uniformly in F q 
and therefore, the multiplicity m 1 ,( l ) i is a uniform random 
variable taking values in {pi, . . . ,p q }. Moreover, Pi(vi E 
G)=q k ~ n . 

We then know that Sm( v ) is a sum of i.i.d. random 
variables m„(iW. As a result, -P{£i} is the same for all Vi ^ c, 
and the events £ 4 and {vi S C} are independent, i.e. 



i=l 



Define the random variable Wi as in the proof of Theorem Q] 
and let Vi,i = 1, . . . , q n be i.i.d. r.v.'s with 



Then 



p(v$ = p 3 = i/«,i< j <«. 

P{£} < q k P{£ i \v i ± c] 



IL 74 

= 9 fc Pr{^LAV^J >EWJ} 

n 

< 9 fc Pr{^(y,-^)>-5}- 

i— 1 

Now let us apply the Chernoff bound ( TT3T > - For any s > 

Pr{£(V, -«',)>—} 

±H"( 



i=i 



Algebraic manipulations and the appropriate definition of Eg 
yield the theorem. □ 

It is of interest to find out the behavior of Ea compared 
to Eb across all values of R. Let us compare the two error 
exponents in an example. 

Example 5: Consider once again Example Q] (p max = 0.8, 
p min = 0.2, A = 100, q = 256, and 7 = 0.68). Figure 3 
shows the comparison of Ej, to Eg. ne can see that reliable 
communication, in the list-decoding sense, is assured for rates 
less than 0.67, and reliable communication, based on the 
probability of error of selection, is guaranteed for rates less 
than 0.41. 

V. Multivariate Interpolation 

In this section, we estimate the decoding radius of ASD for a 
new class of codes introduced recently by Parvaresh and Vardy 
in [14]. These codes are constructed as evaluations of M > 2 
polynomials. A multivariate interpolation decoding algorithm 
for the codes in [14] is shown to exceed the GS decoding 
radius for low values of the code rate R. In this section we 
extend our analysis of ASD to multivariate interpolation. 




Code Rate (R) 



Fig. 4. Trivariate decoding of a PV code compared to ASD of a RS code. 



A code C in the Parvaresh-Vardy (PV) family is defined 
as follows. Let {1, j3%, . . . , /3m -1} be a basis of V q M over 
F q , let {ai, et2, . . . , clm-i} be a set of positive integers greater 
than 1, and let e(X) be an irreducible polynomial over ¥ q . 
Given a message u, the encoder constructs f(X) as the 
polynomial derived from it and finds the set of polynomials 
{5iW,S2pO,- ■ .,g M -i{X)} by computing 



9i {X) = (f(X)) a * mod e(X). 



(15) 



A codeword c = {ci, C2, ■ ■ • , Cn} of a PV code that is 
associated with u is found through the evaluation 



A/-1 



<H = f{xi) + 0jgj(xi), Vi : 1 < i < n. 
i=i 

It follows that the rate of a PV code C is R = k/Mn 
and the minimum distance is d = n — + Since (T5[ is a 
non-linear operation, the code C is not necessarily linear. 

A. Soft-Decision Decoding of PV Codes 

Although Parvaresh and Vardy only considered hard- 
decision decoding in [14], soft-decision decoding of Folded 
RS Codes, a broader class of codes that contains PV codes, 
was considered by Guruswami and Rudra in [6]. Remark 4 
of [6] includes a condition for list-decoding success. In the 
next theorem we present a more stringent condition for list- 
decoding success of a PV code transmission. 

Theorem 5: Let C be an [n, k] PV code over ¥ q M com- 
municated over a discrete, memoryless channel with additive 
noise. Suppose that it is decoded using a multivariate version 
of the ASD algorithm. A codeword c — (ci, . . . , c„) will be 
included in the list output by the algorithm if 



i=i 



Z c(i), 



> M + l 



M)"E 



1,3 



m 



1,3 



M 



where niij is an element of the q x n multiplicity matrix 
M. 

Proof: By extending equation (38) of [13], we can derive 
an upper bound for the weighted degree of the multivariate 
polynomial as 



wdeg Q(X,Y U ...,Y M ) 



< 



\ 



M 



(fc-i) M £]> 



i) 



i.j 1=0 



(16) 



where wdeg X % Y{ 



V3M 
- 1 M 



(k — 1) J2bL 1 31- If the score 



Sm ( c ) exceeds the RHS of ( TToT l. then c is on the algebraic 
soft-decision decoder's list by an argument similar to the one 
employed to prove Lemma [2] □ 

B. Multivariate Error Decoding Radius 

Suppose the PV code is transmitted over a channel with 
transition probabilities {p\,P2, ■ ■ ■ ,P q ^'}- The statistics p m i n , 
Pmaxi an d 7 are defined as before over this new set of transition 
probabilities. An error radius is given in Theorem [6] for soft- 
decision decoding of PV codes. 

Theorem 6: Given a PV code with rate R = k /Mn is used 
to communicate over an additive-noise channel. If 



f Pmax 
- < 

n 



M+il R M M M 



M+iy. 2^i=i ll;=olP' + l l A ) _ a 



(17) 

then an algebraic soft-decision decoder, with complexity factor 
A, will produce a list that contains the transmitted codeword 
c. 

The proof is very similar to the proof of Theorem [TJ and is 
omitted. The multivariate ASD error radius is larger than the 
bivariate ASD error radius for low-rate codes. This claim is 
shown through an example. 

Example 6: Let us return to the typewriter channel, p max = 
0.8, Pmin = 0.2, 7 — 0.68, and A = 100, and compare 



trivariate soft-decision decoding of PV codes to bivariate soft- 
decision decoding of RS codes (in other words, ASD). Figure 
4 shows the error radii Q and (IPTl) . The graph shows that 
trivariate decoding provides an improvement over the bivariate 
one for rates less than 0.3. 

VI. Conclusion 

The results presented in this paper have shown that soft- 
decision algebraic list decoding of RS and related codes and is 
able to outperform its hard-decision counterparts for low-rate 
to medium-rate codes. An estimate of the error decoding radius 
derived in the paper enables ASD to be compared for the 
first time to other RS decoding methods. This result has also 
been extended to multivariable RS codes. A better estimate of 
the error probability for list decoding under ASD is derived 
which shows that at lower rates, the list decoding error is 
not an adequate performance criterion for this algorithm. A 
comprehensive probability of error bound is also derived that 
includes the previously overlooked probability of selection 
error. 

An open question that remains unanswered is if ASD's 
performance makes it a worthwhile decoder to use in Reed- 
Solomon coding applications. For low-rate coding applications 
with channels that are far from q-ary symmetric, ASD shows 
the potential to correct a greater number of errors than hard- 
decision decoders. However, an interesting (and important in 
applications) fact of ASD decoding outperforming its hard- 
decision counterparts for high-rate codes claimed in some ex- 
perimental studies, so far has not been confirmed by theoretical 
analysis. 
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